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Abstract. We generalize the IP-polynomial Szemeredi theorem due to Bergelson and 
McCutcheon and the nilpotent Szemeredi theorem due to Leibman. Important tools in 
our proof include a generalization of Leibman's result that polynomial mappings into 
a nilpotent group form a group and a multiparameter nilpotent Hales-Jewett theorem. 



1. Introduction 

Furstenberg's ergodic theoretic proof ||Fur77ll of Szemeredi's theorem on arith- 
metic progressions HSze75|| has led to various generalizations of the latter. Recall that 
Furstenberg's original multiple recurrence theorem provides a syndetic set of return 
times. The IP recurrence theorem of Furstenberg and Katznelson [FK85], among other 
things, improves this to an IP* set. The idea to consider the limit behavior of a mul- 
ticorrelation sequence not along a Folner sequence but along an IP-ring has proved to 
be very fruitful and allowed them to obtain the density Hales-Jewett theorem HFK91II . 

In a different direction, Bergelson and Leibman [BL96] proved a polynomial mul- 
tiple recurrence theorem that has later been extended from commutative to nilpotent 
groups of transformations by Leibman ||Lei98|| . Many of the additional difficulties in- 
volved in this extension were algebraic in nature and have led Leibman to establish a 
general theory of polynomial mappings into nilpotent groups [Lei02]. An important 
aspect of the proofs of these polynomial recurrence theorems, being present in all later 
extensions including the present article, is that the induction process involves "multi- 
parameter" recurrence even if one is ultimately only interested in the "one-parameter" 
case. 

More recently an effort has been undertaken to combine these two directions. Build- 
ing on their earlier joint work with Furstenberg [ BFM96 ] , Bergelson and McCutcheon 
[BMOO] obtained results that allowed to replace the syndetic set of return times in the 
polynomial multiple recurrence theorem by an IP*-set but did not fully encompass the 
IP multiple recurrence theorem of Furstenberg and Katznelson. This has been amended 
by Bergelson, Haland Knutson and McCutcheon for single recurrence [BHKM06] and 
McCutcheon for multiple recurrence [McC05] (the results of the latter paper also pro- 
vide multiple recurrence along certain generalized polynomials). 

In this article we continue this line of investigation. Our main result, Theorem l5.32l 
generalizes McCutcheon's IP polynomial multiple recurrence theorem to the nilpotent 
setting. Its significance is best illustrated by the following generalization of Leibman's 
nilpotent multiple recurrence theorem (here and throughout the article group actions 
on topological spaces and measure spaces are on the right and on function spaces on 
the left.). 

Theorem 1.1. Let T 1 ,.,.,T t be invertible measure-preserving transformations on a prob- 
ability space {X,j4,h) that generate a nilpotent group. Then for every A e j4 with 
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/i(A) > 0, any m e N and any polynomials p^j e 1\X\, . . . ,X m ], i = 1, . . t, j = 1, . . .,s, 
that vanish at the set 



is IP* in 7L m . 

While the proof strategy is basically the same as in the article of Bergelson and Mc- 
Cutcheon [ BM00 ] there are several new challenges at the technical level. Let us give 
a brief outline. In Section[2]we set up the algebraic framework: IP-polynomials in one 
and several variables with values in a nilpotent group. In Section[3]we prove a stronger 
version of the nilpotent Hales-Jewett theorem due to Bergelson and Leibman (Theo- 
rem [33]). This allows us to deduce a multiparameter version (Theorem 13.61) . That 
version is ultimately applied to polynomial-valued polynomials mappings, see Corol- 
lary |3.8l In Section|4]we define the nilpotent analogue of FVTP systems and prove that 
unitary representations of such systems IP-converge to projections, see Theorem |4.2| 
(this suffices to obtain single recurrence). Here we use the concept of Hirsch length 
that generalizes the notion of the rank of a free Z-module. Finally, in Section[5jwe use 
these preliminary results to prove the multiple recurrence theorem [5.26l The a priori 
restriction to regular measure spaces and certain countable groups of polynomials in 
several variables can be easily removed, and this yields our main result. 



In this section we set up the algebraic framework for dealing with IP-polynomials in 
several variables with values in a nilpotent group. 

We begin with a generalization of Leibman's result that polynomial mappings into 
a nilpotent group form a group under pointwise operations IILei02l Proposition 3.7 
and erratum]. Following an idea from the proof of that result by Green and Tao 
IIGT121 Proposition 6.5] we encode the information that is contained in Leibman's 
vector degree in a prefiltration indexed by N = {0, 1, ... } (see IIGTZ101 Appendix B] 
for related results regarding prefiltrations indexed by more general partially ordered 
semigroups). 

A prefiltration G. is a sequence of nested groups 

(2.1) G >G 1 >G 2 >... such that [G u Gj] c G i+j for every i,;' e N. 

A filtration (on a group G) is a prefiltration in which G = G 1 (and G = G). We will 
frequently write G instead of G . Conversely, most groups G that we consider in this 
article are provided with a prefiltration G. with G = G. A group may admit several 
prefiltrations, and we usually fix one of them even if we do not refer to it explicitly. 

A prefiltration is said to have length d e N if G d+1 is the trivial group and length 
—oo if G is the trivial group. Arithmetic for lengths is defined in the same way as 
conventionally done for degrees of polynomials, i.e. d — t = — oo if d < t. 

Let G. be a prefiltration of length d and teN. We denote by G. +t the prefiltration 
of length d — t given by {G. +t ) t = G i+t and by G./ t the prefiltration of length min(d, t) 
given by G i/t = GjG t (this is understood to be the trivial group for i > t; note that 
G t is normal in each G i for i < t by ( 12. ID ). These two operations on prefiltrations 
can be combined: we denote by G,/ t+s the prefiltration given by G ; / £+s = G i+ jG t , it 
can be obtained applying first the operation / 1 and then the operation +5 (hence the 
notation). 

If G is a nilpotent group then the lower central series is a filtration. More generally, 
if G 1 < G is a normal subgroup then G i+1 = [G,, G x ] defines a prefiltration (that has 
finite length if and only if G 1 is nilpotent). If G. is a prefiltration and d = (d,-) ieN c N is 
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a superadditive sequence (i.e. d i+j > d ; + dj for all i,j e N; by convention d_ x = — oo) 
then G,, defined by 

(2.2) Gf = Gj whenever < i < dj, 

is again a prefiltration. 

We define G. -polynomial maps by induction on the length of the prefiltration. 

Definition 2.3. Let r be any set and be a set of partially defined maps T : r d 
dom(T) -» T. Let G. be a prefiltration of length d e {-oo} UN. A map g: r -» G 
is called G.-polynomial (w.r.t. £7") if either d = — oo (so that g identically equals the 
identity) or for every Te^ there exists a G. +1 -polynomial map D T g such that 

(2.4) D T g = g~ 1 Tg :=g-\g°T) on domT. 

We write P(T, G.) for the set of G.-polynomial maps, usually suppressing any reference 
to the set of maps ST that will remain fixed for most of the article. 

Informally, a map g : T — » G is polynomial if every discrete derivative D T g is 
polynomial "of lower degree" (the "degree" of a G.-polynomial would be the length of 
the prefiltration G„ but we prefer not to use this notion since it is necessary to keep 
track of the prefiltration G. anyway). The connection with Leibman's notion of vector 
degree is provided by J2.2D : a map has vector degree d w.r.t. a prefiltration G. if and 
only if it is Gf -polynomial. 

Note that if g is G.-polynomial then gG t is G./ t -polynomial for any t e N (but 
not conversely). We abuse the notation by saying that g is G./ t -polynomial if gG t is 
G./ t -polynomial. In assertions that hold for all T e & we omit the subscript in D T . 

The next theorem is the basic result about G. -polynomials. 

Theorem 2.5. For every prefiltration G. of length d e {— oo} UN the following holds. 

(1) Let t; e N and g ; : T — » G he maps such that g ; is G.^ d+1 _ t _> +t -polynomial for 
i = 0, 1. Then the commutator [go,gi] is G. +to+ti -polynomial. 

(2) Let go, g x : Y —> G be G.-polynomial maps. Then the product g gi is also G.- 
polynomial. 

(3) Let g: V —> G be a G.-polynomial map. Then its pointwise inverse g~ l is also 
G.-polynomial. 

Proof. We use induction on d. If d = — oo then the group G is trivial and all assertions 
hold trivially. Let d > and assume that the assertion holds for all smaller values of d. 

We prove part (TTJ using descending induction on t = t + t x . We clearly have 
[&o> Si] c G t . If t > d + 1 there is nothing left to show. Otherwise it remains to show 
that £>[g )&i] is G. +t+1 -polynomial. For this end use the commutator identity 

(2.6) D[g , gl ] = [g ,D gl ] ■ [[g ,D gl ], [g ,gi]] 

■ l[go,gi],Dgi] ■ [[g ,giDgi],Dg ] ■ [Dg^g^gj]. 

We will show that the second to last term is G. +t+1 -polynomial, the argument for the 
other terms is similar. Note that Dg is G./( d+1 _ ti ) +to+1 -polynomial. By inner induc- 
tion hypothesis it suffices to show that [g ,giD g x ] is G./( d _ t ) +tl -polynomial. But the 
prefiltration G./y_ t j has smaller length than G., and by the outer induction hypoth- 
esis we can conclude that gi£>gi is G.^ d _ t -polynomial. Moreover g is clearly 
G./( d _ to _ t j-polynomial, and by the outer induction hypothesis their commutator is 
G. /(d _ t )+ti -polynomial again. 

Provided that each multiplicand in (12.66 is G. +t+1 -polynomial we can conclude that 
D[go> gi\ is G. +t+1 -polynomial by the outer induction hypothesis. 

Part ([2]) follows immediately by the Leibniz rule 

(2-7) D(g ogl ) = Dg [Dg ,g 1 ]Dg l 
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from (Q~J) with t Q = 1, t 1 = and the induction hypothesis. 
To prove part ([3]) notice that 

(2.8) Dig- 1 ) = gtDgT'g- 1 = [g _1 ,Dg](Dg) _1 . 

By the induction hypothesis the map g -1 is G./ d -polynomial, the map Dg is G. +1 - 
polynomial and the map {Dg)~ l is G. +1 -polynomial. Thus also D{g~ l ) is G. +1 -polynomial 
by OJ and the induction hypothesis. □ 

Discarding some technical information that was necessary for the inductive proof 
we can write the above theorem succinctly as follows. 

Corollary 2.9. let G. be a prefiltration of length d. Then the set P(T,G.) of G.- 

polynomials on F is a group under pointwise operations and admits a canonical pre- 
filtration of length d given by 

P(T, G.) > P(T, G. +1 ) > > P(T, G. +d+1 ). 

Clearly every subgroup F < P(r, G.) admits a canonical prefiltration F. given by 

(2.10) F t = F n P(r, G. +i ). 

Remark 2.11. If V is a group and 

(2.12) ST = {T b : n ~ nb,dom(T b ) = F, where b e r} 

we recover [LeiC^, Proposition 3.7]. 
Example 2.13. If r is a group and 

(2.14) & = {T aJ , : T T, n anb, where a, b e r} 

then every group homomorphism g : r — > G 1 is polynomial. In particular, every homo- 
morphism to a nilpotent group is polynomial with respect to the lower central series. 

This can be seen by induction on the length d of the prefiltration G. as follows. If 
d = — oo there is nothing to show. If d > write 

(2.15) D T a , b g(n) = g{nT l g{anb) = [g(n), gta^Mab). 

By the induction hypothesis gG d is G./ d -polynomial and the constant maps g(a) _1 , 
g[ab) are G. +1 -polynomial since they take values in G 1 . Hence D T ^ b g is G. +1 -polynomial 
by Theorem [23] 

We will encounter further concrete examples of polynomials in Proposition |2.24l and 
Lemma [4722] 

IP-polynomials. In this article we are interested in the case T = where ^ is 
the partial semigroup^ of finite subsets of N with the operation a* [3 = au that is 
only defined if a and /3 are disjoint. It is partially ordered by a < [3 if and only if 
max a < min/3 (note that in particular < a and a < for any a e ^ ). 
The set is then given by 

(2.16) ST = {T a : p -> a * /3,dom(T a ) = {j3 : a n ft = 0}, where a e r}. 

If T = T a then we also write D a instead of D Ta . We write P o (^0» G.) f° r tne subgroup 
of polynomials that vanish at 0. For every g e P (^b> G.) an d ^ e ^ we have 

(2.17) g(/3) = g(0)D i3 g(0)GG 1 . 
Therefore the symmetric derivative D, defined by 

(2.18) b p g{a) = DpgtaMPT 1 = g(aT l g(a u /3)g(/3)" 1 , 

1 A partial semigroup [ BBH94] is a set F together with a partially defined operation *:Txr->r that is 
associative in the sense that (a* b)* c = a*(b *c) whenever both sides are defined. 
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maps P o (^0> G.) i nt0 foC^0> G. +1 ). Moreover P o C^0, G.) admits the canonical prefiltra- 
tion of length d — 1 given by 

P C% G.) > P o (J*0, G. +1 ) > > P o (jr , G. +d ). 

The group P o (^0>G.) can also be alternatively characterized by P o (^0>G.) = {1 G } if 
d = —oo,0 and 

g e P C% G.) ^ g:&^ G 1 and VfJDpg e P C% G. +1 ). 

Polynomials in several variables. As we have already mentioned in the introduction, 
the inductive procedure that has been so far utilized in all polynomial extensions of 
Szemeredi's theorem inherently relies on polynomials in several variables. We find it 
more convenient to define polynomials of m variables not on <^ m , but rather on the 
subset (J^)™ c J^ m that consists of ordered tuples, i.e. 

= K«i, • • • , O e : a x < ■ ■ ■ < aj. 

Analogously, (^0)" is the set of infinite increasing sequences in We will frequently 
denote elements of (.^0)™ or (^0)" by a = (a ls a 2 , . . . ). 

Definition 2.19. Let G. be a prefiltration and F < P o (^0,G.) a subgroup. We define 
the set F® m of polynomial expressions in m variables by induction on m as follows. We 
set F®° = {1 G } and we let F® m+1 be the set of functions g : ( J" )™ +1 -» G such that 

g(aj, a m+l ) = W a "-' a ">(a m+1 )S(a 1 , . . . , aj, 

where S e F® m and W" 1 '--"- e F for every a 2 < • ■ • < a m . 

Note thatF® 1 =F. 

Lemma 2.20. Suppose that F is invariant under conjugation by constant functions. Then, 
for each m, the set F® m is a group under pointwise operations and admits a canonical 
prefiltration given by (F® m ), = (F ; )® m 

IfK<Fisa subgroup that is invariant under conjugation by constant functions then 

R ®m < F ®m ig aho a subgroup 

Proof. We use induction on m. For m = there is nothing to show. Let 

R } e (F tj f m+1 : (a 1( . . . , a m+1 ) - W/ 1 -' a "Ca m+1 )S/o 1) . . . , a m ), j = 0,l 

be polynomial expressions in m + 1 variables. Suppressing the variables a ls . . . , a m we 
have 

^ ' 

€F 

so that RqP^ 1 e F® m+1 . Hence F® m+1 is a group. 

In order to show that (F.)® m+1 j s indeed a prefiltration we have to verify that 

[RM = [WbSo.WiSi] e(F to+ti ) m+1 . 

This follows from the identity 

[xy,uv] = [x,u] [x,v][[x,v], [x,u]] [[x,u], v] 

• [[x,v][x,u][[x,u],v],y] 

• [j,v],[y,u][[y,u],v]. 

It is clear that K® m < F® m is a subgroup provided that both sets are groups. □ 

For every m e N there is a canonical embedding F® m < F® m+1 that forgets the last 
variable. Thus we can talk about 

F®" := injlimF® m = U m€N F® m , 

meN 

this is a group of maps defined on (J^)" with prefiltration (F®") ; = (F;)®". 
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Polynomial-valued polynomials. 

Definition 2.21. Let G. be a filtration of length d. A VIP group is a subgroup F < 
P o (^0, G.) that is closed under conjugation by constant functions and under D in the 
sense that for every jeF and a e & t the symmetric derivative D a g lies in F x (defined 
in dZTOt "). 

As an example, the group P o (^0> G.) itself is VIE 

Proposition 2.22. Let F < P {^,G.) be a VIP group. Then for every g e F® m the 
substitution map 

(2.23) h:0 = (ft, . . . , ft) e (J? )™ - (g[ft : a e (^,)« - g(U, eft a u U i6fc a ; )) 
iies mP C%F^f m . 

Proo/ We proceed by induction on m. In case m = there is nothing to show, so 
suppose that the assertion is known for m and consider g e i7® m+1 . By definition we 
have 

g{a x , a m+1 ) = W ai -- a '"(a m+1 )S(a 1 , . . . , a m ) 

and 

KPi, ft+i)(5) = W W-- u ^ [ft +1 ](a)S[ft, . . . ,ft](5). 
In view of the induction hypothesis it remains to verify that the map 

h : /3 ~ (a ~ w Ufe ft a ""- Ufe ft» ai [ft] (a)), /3 > ft > • • • > ft 

is in P o (J r ,F 8 " u ). The fact that h(ft) e F®" for all /3 follows by induction on |ft using 
the identity 

w u ieft a i ,...,u ieft „a i |-^ u { b j]( 5 ) = 

w u ieft o,,..,^ ( 0i )fi w u ^i a '-- u *fa a ' (a b ) W u '-ft "i.-Mi*. [ft] (5) 

that holds whenever b > fi > ft > ■ ■ • > ft . In order to see that ft is polynomial in fi 
observe that 

b r h{l3) : a -» D Ufe] . ai W u -ft a '-- u ^ a '(U ie/3 a,), /3 > r > ft > • • • > ft . □ 

Monomial mappings. In this section we verify that monomial mappings into nilpotent 
groups in the sense of Bergelson and Leibman [BL03 , §1.3] are polynomial in the sense 
of Definition 1231 

For a sequence of finite sets R = (R Q ,R 1 , . . . ) only finitely many of which are non- 
empty and a set a write 

R[a] := a x R i+i a 1 x R 1 l±l . . . 

Here the symbol i+i denotes disjoint union and a 1 are powers of the set a (note that a 
consists of one element, the empty tuple). 

Proposition 2.24. Let G. be a prefiltration of length d and N c N any subset. Let 
g. : R[N] —* G, x — » g x be a mapping such that g.(AP x R ; ) c G, for every ieN and -< 
be any linear ordering on R[N]. Then the map 

g: &{N)^G, a~ ]~[ gj 

j€R[a] 

is G,-polynomial on the partial semigroup &{N). 
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Proof. We induct on the length of the prefiltration G.. If d = — oo then there is nothing 
to prove. Otherwise let fi e J^fJV). We have to show that Dpg is G. +1 -polynomial. 

Let B c R[N] be a finite set and AcB. By induction on the length of an initial 
segment of A (that proceeds by pulling the terms g-, j e A, out of the double product 
one by one, leaving commutators behind) we see that 

-< -< -< ^—lexicographic 

( 225 ) nu=nu n n 

j&B j€A jeB\A k€A^ d 

where A- d is the set of all tuples of elements of A with at most d coordinates in JV and 



ij> Sj,(k ,...,k t ) ■ 



[Sj,(k ,...,k i -i)' Sk t ] if j -< K -< — < K, 

1 otherwise. 



Let a e J^(JV) be disjoint from /3. Applying A2.25D with A :=R[a] andB :=R[aUl3] 
we obtain 

-< ^—lexicographic 

Dpg{a)= \\ 11 &i*> 



where gj ) (k ,..^k i ) e ^h+l +•••+! provided that j e a' x R ( and fc e a' x R ; ,...,7^ e 
a'' Xi^. 

The double product can be rewritten as 0(es[al ^ ^ or some sequence of finite sets 
S, an ordering -<' on S[N'], where N' = N\[3, and h. : S[iV'] -» G. The sequence of 
sets S is obtained by the requirement 

(R[aU|5] \R[a]) xR[a]- d =S[a] 

for every a c JV'. The lexicographic ordering on [R[N] \ R[N']) x R[JV']- d induces an 
ordering -<' on S [JV'] . Define h z = g if fj, fc) corresponds tozeS [JV'] . 

By construction we have h.((JV')' x S t ) c G I+1 since each element of R[N] \ R[N'] 
has at least one coordinate in JV but not JV'. Thus Dpg is G. +1 -polynomial by the 
induction hypothesis. □ 

Corollary 2.26. Let G be a nilpotent group with lower central series 

G = G = G 1 >--->G S >G S+1 = {1 G }, 

Zet g. : N d —> G be an arbitrary mapping and -< be any linear ordering on N d . Then the 
map 

-< 

g:J?(JV)^G, a^n^ 

j€a d 

is polynomial on the partial semigroup &(N) with respect to the filtration 
(2.27) G > G 1 > ■ • ■ > G x > •■• > G s > ■ • ■ > G S >G S+1 . 

d times d times 

3. Nilpotent Hales-Jewett 

In this section we refine the nilpotent Hales-Jewett theorem due to Bergelson and 
Leibman [BL03, Theorem 0.19] using the induction scheme from [BL99, Theorem 
3.4]. This refinement enables us to deduce a multidimensional nilpotent Hales-Jewett 
theorem that is required in the proof of our measure-theoretical recurrence result. 
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PET induction. First we describe the PET (polynomial exhaustion technique) induc- 
tion scheme ||Ber87ll . For a polynomial g e P o (^0, G.) define its level Z(g) as the great- 
est integer Z such that g e P o (^0> We define an equivalence relation on the set 
of non-zero G. -polynomials by g ~ h if and only if Z(g) = 1(h) < Z(g _1 /i). Transitivity 
and symmetry of ~ follow from Theorem 12.5 1 

Definition 3.1. A system is a finite subset A c P O (^ ,G.). The weight vector of a system 
A is the function 

I >-» the number of equivalence classes modulo ~ of level Z in A. 

We order the weight vectors lexicographically. 

Proposition 3.2. Let Abe a system, h&Abe a mapping of maximal level and B c G 1; 
M c ^1 k /irate sets. Then the weight vector of the system 

A" = {bh-'gD.gb- 1 , g£A,atM,b£B}\ {1 G } 

precedes the weight vector of A 

Proof. We claim first that the weight vector of the system 

A = {h- l gD a g, aeM,geA}\{l G } 

precedes the weight vector of A. Indeed, if Z(g) < 1(h) then g ~ h~ 1 gD a g. If 
Kg) = Kg) = Kh) and g ~ g / h then /i _1 g£' a g ~ h _1 gD a g. Finally if g ~ h then 
l(h~ 1 gb a g) > 1(h). Thus the weight vector of A' does not differ from the weight of 
vector of A before the Z(h)-th position and is strictly smaller at the Z(h)-th position, as 
required. 

We now claim that the weight vector of the system 

A" = {bgb-\ geA'.beB} 
coincides with the weight vector of the system A'. Indeed, this follows directly from 

bgb- l =g[g,b-^~g. D 

Nilpotent Hales-Jewett. The following refined version of the nilpotent Hales-Jewett 
theorem [BL03, Theorem 0.19] does not only guarantee the existence of a "recurrent" 
point but also allows it to be chosen from a finite subset Sx of any given orbit. 

Theorem 3.3 (Nilpotent Hales-Jewett). Assume that G acts on the right on a compact 
metric space (X,p) by homeomorphisms. For every system A, every e > and every 
H e & there exists H < N e & and a finite set S c G suc/i that for every x e X there 
exist a non-empty a c N and seS suc/i that p(xsg(a), xs) < e for every g e A 

The fact that this theorem does indeed generalize IIBL03I Theorem 0.19] follows 
from Corollary [236] that substitutes IIBL031 §1 and §2]. We refer to IIBL031 §5.5] 
and [BL99, §3.3] for a discussion of the relation to the classical coloring Hales-Jewett 
theorem. In a nutshell, the coloring Hales-Jewett theorem follows from an analogue 
of Theorem I3.3l for commutative semigroups of continuous maps onX. 

Proof. We use PET-induction on the weight vector w(A). If w(A) vanishes identically 
then A is the empty system and there is nothing to show. Assume that the conclusion is 
known for every system whose weight vector precedes w(A). Let h e A be an element 
of maximal level, without loss of generality we may assume h^l G . Let fc be such that 
every Zc-tuple of elements ofX contains a pair of elements at distance < e/2. 

We define inductively finite sets H i , finite sets B i ,B i c G, systems A, whose weight 
vector precedes w(A), positive numbers e, and finite sets N i as follows. Begin with 
H Q := H and B = B = {1 G }. The weight vector w(Aj of the system 

A, := {bh^gDngb- 1 , mcJV U---UJV,_ 1 ,bGB 1 } 
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precedes w(A) by Proposition P3]2] By uniform continuity we can choose e, such that 

e 

pU,y) < e, => Vb eB, p(xb,yb) < — . 

By the induction hypothesis there exist a finite set H t < N t e & and a finite set S, cG 
such that 

(3.4) Viel 3n, c JV^s,- eS, Vg eA ; pCxs.gCn,),^) < e ; . 
Finally let H i+1 := H, U N t and 

B i+1 := {Sibh^y 1 , a, c JV,,s ; e S,, beBjc G, 

:={bg(m), g e A, m c JV u • • • U JV„ b e B !+1 } c G. 

This completes the inductive definition. Now fix x el. We define a sequence of points 
j ; by descending induction on i. Begin with y fc := x. Assume that y f has been chosen 
and choose n ; c JV ; and s ; e S, as in Q3.4D , then set y £ _j := y ; s, . 

Finally let x := yo 5 o^( n o) _1 an d := Xjh(n i+1 ) _1 . We claim that for every g e A 
and < i < j < fc we have 

(3.5) p(x j g(n i+1 U---Un J ),x i )< ^C/~0- 

This can be seen by ascending induction on j. Let i be fixed, the claim is trivially true 
for j = i. Assume that the claim holds for j — 1 and let g be given. Consider 

b := s=i . . .s fr(n ) _1 . . .h(n,-_;i) _1 e Bj and 

b := bg(n i+1 U---Un J _ 1 )eB r 
By choice of and Si we have 

P(.y,s j frfr(n j r 1 g(n i+1 U • • • U n,-)g(Hi+i u • • • U n,^) -1 ^ 1 ,^) < e ; -. 
By definition of fj this implies 

p(y ; s J bh(n J )- 1 g(n 1+1 U---Un J )g(n 1+1 U---Un ; _ 1 )- 1 b- 1 b,y J s J b) < — . 
Plugging in the definitions we obtain 

Vjt.- , eCn.- U---Un.- ,)) < 

2k 



e 

p(x j g{n i+1 U---Un j ),x j _ 1 g{n i+1 U---Un j _ 1 )) < — . 



The induction hypothesis then yields 
p(.Xjg(n i+l U ■ ■ ■ U rijXxi) 

< p{xjg{n l+l u ■ • • u n^,x ; _ 1 g(n ;+1 u • • • u ^_ x )) + p(x J _ 1 g(n i+1 u • ■ • u n^),*,) 
as required. 

Recall now that by definition of k there exist < i < j < k such that p[x h Xj) < |. 
By J3.5D we have 

p(Xjg(n i+1 U • ■ • U rij), Xj) < p{Xjg{n i+l U • • • U n } ), x t ) + p(x h x } ) < e 

for every g e A. But x ; = xs for some 

s e S := S fc . . .Soh^CJVo))- 1 . . . K&tNJT 1 , 

and we obtain the conclusion with JV = N U ■ • • U N k and S as above. □ 

We remark that ||BL99l Theorem 3.4] provides a slightly different set S that can be 
recovered substituting y k := xh[N k ) and y ; _j := j;S,?i(iVi-i) in the above proof and 
making the corresponding adjustments to the choices of B h b and S. 
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Multidimensional nilpotent Hales-Jewett. Just as in the commutative setting, a mul- 
tidimensional version of the Hales-Jewett theorem can be deduced from the one- 
dimensional version. 

Theorem 3.6 (Multidimensional nilpotent Hales-Jewett). Assume that G acts on the 
right on a compact metric space (X,p) by homeomorphisms and let meN. For every finite 
set A c P o (^0> G.)® m , every e > and every H G there exists a finite set H < N G & 
and a finite set S c G such that for every x G X there exists s G S and non-empty subsets 
a 1 < • • • < a m c N such that p(xsg(a 1 , ... , a m ), xs) < e for every geA 

Proof. We use induction on m. The base case m = is trivial. Assume that the conclu- 
sion is known for some m, we prove it for m + 1. 

Let A c P o (^0> G.)® m+1 and H be given. For convenience we write a = (a 1; . . . , a m ) 
and a = a m+1 . By definition each g g A can be written in the form 

g(a x , a m+1 ) = #2 O)gi(a) 

with gl e P C% G.) 8m and gf e P o (Jf 05 G.). 

We apply the induction hypothesis with the system {g 1; g e A} and e/2, thereby 
obtaining a finite set JV > H and a finite set S c G. We write "a c N" instead of 
u a 1 < ■ ■ ■ < a m c N". 

By uniform continuity there exists e' such that 

p[x,y)<e' =^ V5GS,acJV,geA p(x5g 1 (a),ysg 1 (5)) < e/2. 

We invoke Theorem 13.31 with the system {sg^s -1 ^ eS,acW,g e A} and e', this 
gives us a finite set N' > N and a finite set S' c G with the following property. For 
every x e X there exist s' e S' and aczN' such that 

Vs eS,5 ciV,g eA pCxs'sg" (a)5 _1 ,x5') < e'. 

By choice of e' this implies 

Vs e S, a c JV,g e A p(x5 / sgf(a)g 1 (5),x5 / sg 1 (5)) < e/2. 

By choice of JV and S, considering the point xs', we can find a c N and seS such that 

VgeA p(xs'sg 1 (a),xs's) < e/2. 

Combining the last two inequalities we obtain 

Vg e A p(xs'sg(a, a), xs's) < e. 

This yields the conclusion with finite sets NUN' and S'S. □ 

The combinatorial version is derived using the product space construction of Fursten- 
berg and Weiss HFW78II . 

Corollary 3.7. Let G. be a filtration on a countable nilpotent group G, m G N and 
A c P o C^"0>G.)® m a finite set and I G N >0 . Then there exists N G N and finite sets 
S,T c G such that for every l-coloring of T there exist a x < ■ ■ ■ < a m c N and seS such 
that the set {sg(a), g G A} is monochrome (and in particular contained in T). 

Proof. Let X := l G be the compact metrizable space of all I -colorings of G with the 
right G-action xg(h) = x(gfi). We apply Theorem 13.61 to this space, the system A, set 
H = and a sufficiently small e to ensure that p(x,x') < e implies x(e G ) = x'(e G ). 

This yields certain JV e N and S c G that enjoy the following property: given 
a coloring x g X there exist a x < • - • < a m c JV and s g S such that {sg(a), g g 
A} is monochrome. Observe that this property only involves a finite subset T = 
U geA Sg(^(N)™) c G. ' □ 
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In the proof of our measure-theoretic recurrence result we will apply this combina- 
torial result to polynomial-valued polynomial mappings. We encode all the required 
information in the next corollary. 

Corollary 3.8. Let m e N, K < F < P G% G.) be VIP groups and FE < P C% G S° J be 
a countable subgroup that is closed under substitutions g >-» g[/3] (recall (12.23 D J. 

Then for any finite subsets (i?,)- =0 C K® m n FE, (W^lJ c F® m n FE there exist 
N,w e N and (L^M;)^ c (K"® N n FE) x (F® N n FE) such that /or every l-coloring 
of the latter set there exists a number a and sets fJ 1 < ■ ■ ■ < fl m c JV such that the 
set {L a R i [p],M a Wi c [p]L~ 1 \j ( is monochrome (and in particular contained in the set 
(L t ,Mi)f =l ). We may assume L 1 = 1 G . 

Proof. By Proposition ^. 22l the maps j3 — > (P ; [/3], Wj. [/3]P; [/3]) are polynomial expres- 
sions with values in K®" x F 8w . By the assumption they also take values in FE x FE. 
Given an Z -coloring % or (X®" n FE ) x n FE ) we P ass t0 the Z -coloring x(g,h) = 
Xigyhg" 1 ). Corollary [3T71 then provides the desired JV and (L i ,M i )f =1 = T US. □ 

4. FVIP GROUPS 

In this section we consider a class of IP-polynomials that IP-converge to orthogonal 
projections. 

Definition 4.1. An FVIP group is a finitely generated VIP group. 

The main result about FVIP groups is the following "nilpotent" version of [HFM96, 
Theorem 1.8] and [BHKM06, Theorem 1.9] that will be used to construct "primitive 
extensions" (we will recall the definitions of a primitive extension and an IP-limit in 
due time). 

Theorem 4.2. Let G. be a prefiltration of finite length and F < P (& 9 , G.) be an FVIP 
group. Suppose that G acts on a Hilbert space H by unitary operators and that for each 
(go) a e F the weak limit P g = w-IP-lim ae j? g a exists. Then 

(1) each P g is an orthogonal projection and 

(2) these projections commute pairwise. 

The finite generation assumption cannot be omitted in view of a counterexample in 
[BFM96]. We begin with some algebraic preliminaries. 

Hirsch length. We use Hirsch length of a group as a substitute for the concept of the 
rank of a free Z-module. Recall that a subnormal series in a group is called polycyclic 
if the quotients of consecutive subgroups in this series are cyclic and a group is called 
polycyclic if it admits a polycyclic series. 

Definition 4.3. The Hirsch length h(G) of a polycyclic group G is the number of infinite 
quotients of consecutive subgroups in a polycyclic series of G. 

Recall that Hirsch length is well-defined by the Schreier refinement theorem, see e.g. 
[Rot95, Theorem 5.11]. For a finitely generated nilpotent group G with a filtration G. 
one has 

h(G) = J] rank G;/G i+1 . 

i 

Lemma 4.4. Let G be a finitely generated nilpotent group. Then for every subgroup 
V <Gwe have that h(V) = h(G) if and only if [G : V] < oo. 

Proof. If [G : V] < oo then we can find a finite index subgroup W < V that is normal 
in G and h(G) = h(W) = h(V) follows from the Schreier refinement theorem. 

Let now G. be the lower central series of G. Let V < G be a subgroup with h(V) = 
h(G) and assume in addition that G ; < V for some i = 1, ...,d + 1. We show that 
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[G : V] < oo by induction on i. For i = 1 the claim is trivial and for £ = d + 1 it 
provides the desired equivalence. 

Assume that the claim holds for some i. Let V ; = V n G ; be the filtration on V 
induced by G. and assume V i+1 = G i+1 . By the assumption we have 

d d 

rank G ; /G ;+1 = h(G) = h(V) = ^]rankV,/V, +1 , 

;=1 j=l 

and since V ; /V ;+1 = V ; G ;+1 /G J+1 < Gj/Gj +1 for every j this implies that VjG i+1 < 
Gj/G i+1 is a finite index subgroup. Let K c G, be a finite set such that KVjG i+1 = 
G t /G i+1 . Then KV < G is a subgroup and a finite index extension of V. Moreover 
KV 2 G, and by the first part of the lemma we have h(KV) = h(V). 

By the induction hypothesis KV has finite index in G, so the index of V is also 
finite. □ 

Lemma 4.5. Let G be a finitely generated nilpotent group with a filtration G. of length 
d and letV < G a subgroup. Then for every j = 1, . . .,d + 1 and every geG there exist 
at most finitely many finite index extensions ofV of the form (V,gc) with ceG r 

Proof. We use descending induction on j. The case j = d + 1 is clear, so assume that 
the conclusion is known for j + 1 and consider some g e G. 

Let c a , a = 0, 1 be elements of Gj such that (V,gc a ) are finite index extensions of V. 

Then also (VGj +1 ,gc a ^ /Gj +1 is a finite index extension of VG ;+1 /G J+1 , so that there 
exists an m > such that (gc a G ;+1 ) m e VG ;+1 /G J+1 for a = 0, 1. 

Since the elements c a G ;+1 are central in G/G J+1 this implies (c^ 1 c 1 ) m G J+1 e (VG J+1 n 
Gj)/Gj +1 . But the latter group is a subgroup of the finitely generated abelian group 
Gj/Gj +1 , so that Cg 1 c 1 g K(VG J+1 nG ; ) for some finite set K c G ; that does not depend 
on c ,c x . 

Multiplying c x with an element of V we may assume that c 1 e c JCG, +1 . By the 
induction hypothesis for each g' e gc if there exist at most finitely many finite in- 
dex extensions of the form (V,g'c') with c' e G J+1 , so we have only finitely many 
extensions of the form (VjgCj) as required. □ 

Corollary 4.6. Let G be a finitely generated nilpotent group and V be a subgroup. Then 
there exist at most finitely many finite index extensions ofV oftheform (V,c). 

Proof. Consider any filtration G. and apply Lemma R31 with j = 1 and g = 1 G . □ 

Note that, in general, there does not exist a unique maximal finite index extension 
of a subgroup V < G. Take for example the group G generated by x,y,z with the 
relations 1 G = x 2 = y 2 , xz = zx, yz = zy, [x,y] = z and consider V = {1 G }. Then 
the groups generated by x and by y are extensions of V with index 2, but (x,y) = G 
is infinite. 

The following example shows that Corollary 14.61 may fail for virtually nilpotent 
groups. Consider the semidirect product G = Z 2 K Z that is associated to the inversion 
action n : Z 2 r\ Z given by 7T(d)(b) = (,-lfb. Then G 2 = [G, G] = 2Z is an abelian 
subgroup of index 4 and G i+1 = [G, GJ = 2 ! Z for all i e N, so G is not nilpotent. Let 
V = {0} < G be the trivial subgroup. Since we have (la) 2 = e V for any a e Z, each 
group of the form (v, la"j is an extension of V with index 2. On the other hand, for 
every value of a we obtain a different extension. 

Partition theorems. Let & := ^ \ {0} be the set of non-empty finite subsets of the 
integers. An IP-ring is a subset of & that consists of all finite unions of a given strictly 
increasing chain a < a 1 < . . . of elements of & IIFK851 Definition 1.1]. In particular, 
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& is itself an IP-ring (associated to the chain {0} < {1} < . . . ). Polynomials are gener- 
ally assumed to be defined on even if we manipulate them only on some sub-IP- ring 
of 

Since we will be dealing a lot with assertions about sub-IP-rings we find it conve- 
nient to introduce a shorthand notation. If some statement holds for a certain sub-IP- 
ring C & then we say that it holds without loss of generality (wlog). In this case we 
reuse the symbol & to denote the sub-IP-ring on which the statement holds (in partic- 
ular this IP-ring may change from use to use) and redefine JF fl := & u {0}. With this 
convention the basic Ramsey-type theorem about IP-rings, Hindman's theorem, reads 
as follows. 

Theorem 4.7 (Hindman [Hin74]). Every finite coloring of 3P is wlog monochrome. 

This is not the same as the assertion "wlog every finite coloring of & is mono- 
chrome", since the latter would mean that there exists a sub-IP-ring on which every 
coloring is monochrome. 

Recall also the following multiparameter version of Hindman's theorem. 



Theorem 4.8 (Milliken [Mil75], Taylor [Tay76]). Every finite coloring of is wlog 
monochrome. 

The next lemma is a substitute for [BFM96, Lemma 1.6] in the non-commutative 
case. This is the place where the concept of Hirsch length is utilized. 

Lemma 4.9. Let G be a finitely generated nilpotent group and g: & — > G fee any map. 
Then wlog there exist a natural number I > and a subgroup W < G such that for any 
ctj < • ■ ■ < a; e 3? the elements g a , ... ,g ai generate a finite index subgroup of W. 

Proof. By the Milliken-Taylor theorem RTSl we may wlog assume that for each Z < Zr(G)+ 
1 the Hirsch length ft( {g ai , . . . , g ai y ) does not depend on (a x , . . . , a ( ) e J^, . Call this 
value h ; . It is an increasing function of I that is bounded by fe(G), hence there exists 
an Z such that fr ; = h i+1 . Fix some (a 1; . ..,aj) e ^ and let V := (g ai ,- ■•,g ai /- 

Since h l+1 = hi and by Lemma [4741 we see that (V,g a ) is a finite index extension of 
V for each a > a ( . By Corollary [43] and Hindman's Theorem 14. 71 we may wlog assume 
that each g a lies in one such extension W. By definition of fe ( this implies that wlog 
for every (a 1; . . . , otj) e the Hirsch length of the group (g ai , ■ ■ ■ ,g ai ) — W is h(W). 
Hence each {g ai , . . . , g ai > < W is a finite index subgroup by Lemma [44] □ 

In course of proof of Theorem 14.21 it will be more convenient to use a convention 
for the symmetric derivative that differs from ( 12.18D . namely 

D a gil3) = g{aT 1 D a g((3). 

Clearly a VIP group is also closed under D. We begin with two results that allow to 
pass to a finite index subgroup of a VIP group when necessary 

Lemma 4.10. Let F be a VIP group, W < F be a subgroup and V <W be a finite index 
subgroup. Suppose that g e F is such that the symmetric derivative £> a g e W for all a. 
Then wlog for every a the symmetric derivative D a g coincides with an element of V on 
some sub-IP-ring of the form {{5 e $F : /3 > /3 }. 

Proof. Since V has finite index and by Hindman's theorem 14.71 we can wlog assume 
that D a g e w~ l V for some w e W and all a. Assume that w & V. Let 



h{a) :=wD a g ■ 



w, a = \0 

v„ e V otherwise. 



Let a 2 < • • • < a d be non-empty, by induction on d we see that D ai . ..D a h{a) e V for 
all a ^ and D ad . ..D a h(Q) e Vw^V. 
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On the other hand the map a — > h(a)(/3) is G. +1 -polynomial on [a : anjS = 0} 
for fixed /3. Therefore D ad . . .D a h(0) vanishes at all fi > a d , i.e. w coincides with an 



Lemma 4.11 ([BHKM06, Proposition 1.1]). Let W. be a prefiltration, A c P o 0% W.) 
be finite and V < W be a finite index subgroup. Then wlog for every g e A we have 

Proof. Passing to a subgroup we may assume that V is normal. By Hindman's theo- 
rem |4]7]we may wlog assume that every map gV : ^ — » W/V, g e A, is constant on 
& '. Since each gV is polynomial on and vanishes at it vanishes identically. □ 

IP-limits. Let X be a topological space, m e N and g : ^™ -^Ibea map. We call 
xelan IP-limit of g, in symbols IP-lim,j g a = x, if for every neighborhood U of x 
there exists a such that for all a e a > a , one has g s e L7. 

By the Milliken-Taylor theorem F4.8I and a diagonal argument we may wlog assume 
the existence of an IP-limit (even of countably many IP-limits) if X is a compact metric 
space, see [FK85, Theorem 1.5]. 

If X is a Hilbert space with the weak topology we write w-IP-lim instead of IP-lim 
to stress the topology. From the algebraic point of view it is more natural to consider 
maps defined on in this case the IP-limit is just the IP-limit of the restriction to & . 

Following a tradition, we write arguments of maps defined on & as subscripts in 
this section. We also use the notation and assumptions of Theorem 14.21 

The next lemma follows from the equivalence of the weak and the strong topology 
on the unit sphere of H and is stated for convenience. 

Lemma 4.12. Assume thatf e fixP g , i.e. that w-IP-lim a g a f = f. Then also IP-lim a g a f = 
/ (in norm). 

For any subgroup V < F we write P v for the orthogonal projection onto the space 
n geV fixP g . 

Lemma 4.13. Assume that V = (gi, ■ ■ ■ , g s ) is a finitely generated group and that 
P g , . . . , P gs are commuting projections. Then P v = 0;=i Pg t ■ 

Proof. Clearly we have P v < YYi=i Pg > so we onr y nee d fo prove that each / that is 
fixed by P , . . . , P gs is also fixed by P g for any other geV. 

For this end it suffices to show that if / is fixed by P g and P h for some g,/ieV then 
it is also fixed by P gh -i . Lemma [4.12| shows that IP-lim a g a f = f and IP-lim a h a f = f . 
Since each h a is unitary we obtain lP-\im a h~ 1 f = f . Since each g a is isometric this 
implies 



The next lemma is the main tool to ensure IP-convergence to zero. 

Lemma 4.14 ([BFM96, Lemma 1.7]). Let (P a ) ae j? be a family of commuting orthogonal 
projections on a Hilbert space H and f e H. Suppose that, whenever a 1 < ■ ■ ■ < a h one 
has nL P a J = 0. Then IP-lim a ||P a /|| = 0. 

Finally we also need a van der Corput-type estimate. 

Lemma 4.15 ([FK85, Lemma 5.3]). Let (x a ) aejr be a bounded family in a Hilbert space 
H. Suppose that 



element of V on {f5 : f5 > a d }. 



□ 



w-IP-lim gji^f = IP-lim gji^f =f 



as required. 



□ 




Then wlog we have 



w-IP-lim x a = 0. 



a 
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Proof of Theorem \4~2\ We proceed by induction on the length of the prefiltration G.. If 
G. is trivial there is nothing to prove. Assume that the conclusion is known for G. +1 . 

We start by proving the first assertion. Let g e F. Since P g is clearly contractive it 
suffices to show that it is a projection. 

By Lemma |4T9l we may assume that, for some Z > and any a 1 < ■■■ < a h the 
derivatives D ai g, . . . ,D a[ g generate a finite index subgroup of some W < F 1 (recall 
that F 1 = F n P C% G. +1 )). We split 

(4.16) H = n v < vv kerP v ,elin(u v < w imP v ) =:H \JH J} 

where V runs over finite index subgroups of W. It suffices to show P g f = P^f for each 
/ in one of these subspaces. 

Case 0. Let / e H and a x < ■■• < ot;. By the choice of W we know that V := 
(b ai g, . ..,D a[ gJ < W is a finite index subgroup. Since the projections P^ g commute 
by the inductive hypothesis, their product equals P v (Lemma [4.13D and we have P v f = 
by the assumption. 

By Lemma 14. 141 this implies IP-lim a \\P(, g f\\ =0. Therefore 



IP-lim 

a 

so that 



IP-lim U&agW.g-Y. 



< IP-lim || w-mim(D ag yf || = IP-lim \\P 6 f || = 0, 



IP-lim IP-lim (g a 



By Lemma [4. 15| this implies P f = (initially only wlog, but the limit is supposed to 
exist on the original IP- ring). 

Case 1. Let V < W and / = P v f , by linearity we may assume ||/|| = 1. Let p be a 
metric for the weak topology on the unit ball of H with p(x,y) < \\x — y\\. Let e > 0. 
By definition of IP-convergence and uniform continuity of P g there exists a such that 

Va > a p(gJ,P g f) < e and p{P g gJ,P 2 g f) < e. 

By Lemma l4.10| we can choose a> a such that D a g coincides with an element of V 
on some sub-IP- ring and in particular Pf> g f = / . By Lemma [4. 121 there exists /3 > a 
such that 

V/3>/3 Kb a g) p f -f\\<e. 
Applying gpg a to the difference on the left-hand side we obtain 

WSaupf ~ gpgaf II < e, so that p{g aup f, gpgj) < e. 
Observe that a U /3 > a , so that 

p(P g f,gpgJ)<2e. 
Taking IP-limit along f5 we obtain 

p{P g f,P g gJ)<2e. 
A further application of the triangle inequality gives 

p(V> p g 2 /)<3e, 

and since e > was arbitrary we obtain P f = P 2 f . 
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Commutativity of projections. Let us now prove the second assertion, namely that P g 
and P g ' commute for any g,g' e F. Observe that the function a —* g a can be seen as 
a polynomial-valued function in P o (^0,P(.^0, G.)) whose values are constant polyno- 
mials. Moreover we can consider the constant function in P(^0,P.) whose value is g' . 
Taking their commutator we see that 

a [*«,*'] eP o (Jf 0) P(jr ,G.)), 

and since F is a VIP group this map in fact lies in P Q {&®,F.)- By J2.17D it takes 
values in F 1 . By Lemma [T9l we may assume that for any a x < ■■■ < oc ( the maps 
[g a >%']>•■•> Lga,>g'] generate a finite index subgroup of some W < F 1 . Interchang- 
ing g and g' and repeating this argument we may also wlog assume that for any 
a 1 < ■ ■ ■ < ay the maps [g^, g], . ■ ■ , [g' a , g] generate a finite index subgroup of some 
W' <F 1 . Consider the splitting 
(4.17) _ 
H = (n y < w kerP y nrv< w /kerP v ,) ©tin (u v < w imP v U \J v , sw >imP v >) =:H UH 1 . 

Case 0. Let / e H . As above we have IP-lim a l|P[ ga , g ']/ II = 0, and in particular 
= IP-lim ^w-IP-lim [g a , g' p ]f, g~ l P gl f J 

= IP-limlP-lim (g a g' p f,g' p P gl f) = IP-lim {g a P g> f,P gl f) , 

since IP-lim^ g'pP g >f = P g >f by Lemma 14.121 Hence P g P g 'f -L P g 'f, which implies 
P g P g 'f = since P g is an orthogonal projection. 

Interchanging the roles of g and g' we also obtain P g >P g f = 0. 
Case 1. Let V < W and / = P v f . By Lemma |4.11| we may wlog assume that [g a ,g'] e 
V for all a. Let a be arbitrary, by Lemma [4.12| the limit 

IP-lim[g a ,g^]/=/ 

also exists in norm. Therefore 

g a P g 'f = w-IP-lim g a g' p f = w-IP-lim g p g a [g a ,g p ]f = w-IP-lim g' p gj = P g ,gJ. 

Taking IP-limits on both sides we obtain 

p p ,f = p ,p f 

g g J g gJ 

The case V' < W' and / = P v *f can be handled in the same way. □ 

If the group G acts by measure-preserving transformations then the Hilbert space 
projections identified in Theorem |4.2| are in fact conditional expectations as the fol- 
lowing folklore lemma shows. 

Lemma 4.18. Let X be a probability space and (T a ) a be a net of operators on L 2 (X) 
induced by measure-preserving transformations. Assume that T a — » P weakly for some 
projection P. Then P is a conditional expectation. 

Proof. Note that imP n L°°(X) is dense in imP. 

Let /, g e imP n L°°(X). Since the weak and the norm topology coincide on the unit 
sphere of L 2 {X) we have \\T a f — f\\ 2 — * and ||T a g — g|[ 2 — * 0. Therefore 

\\P(fg) ~ fgh ^ limsu P \\T a ifg)-fg\\ 2 = Hmsup \\{TJ -f)T a g+f{T a g - g)|| 2 

a a 

< limsup \\TJ -/|| 2 ||r a g|L + \\f\U\T a g - g|| 2 = 0. 

a 

This shows that imPn L°°(X) is an algebra, and the assertion follows. □ 
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Examples of FVIP groups. In order to obtain some tangible combinatorial applica- 
tions of our results we will also also need non-trivial examples of FVIP groups. It is 
then possible to combine them in an arbitrary fashion as the next lemma shows. 

Lemma 4.19. Let F, F' < P (<% G.) be FVIP groups. Then the group F V F' is also FVIP. 

Proof. The group F V F' is clearly finitely generated and invariant under conjugation 
by constants. Closedeness under D follows from the identity 

(4.20) D m {gK) = h- l i) m gg{m)hbjig{mT l . □ 

Our main example (that also leads to Theorem ll.lD are maps induced by polynomial 
sequences in finitely generated nilpotent groups. In this case the abstract notion of a 
polynomial admits an equivalent formulation in more familiar terms. 

Lemma 4.21 ( [[Lei02l Proposition 3.9]). Let G he a finitely generated torsion-free nilpo- 
tent group with a filtration G. and a basis T 1 , . . . , T t . Then a map g : Z — » G is poly- 
nomial if and only if it has the form g(a) = T^ 1 ^ . . . T^'^ where each p, : Z — > Z is a 
conventional polynomial of degree < j (where j is determined by T t e G ; \ G J+1 J. 

It remains to compose a polynomial sequence with an IP-sequence to obtain an 
FVIP-sequence. 

Lemma 4.22. Let G be a finitely generated nilpotent group with a filtration G.. Let 
( n a)ae5? B be an IP-sequence in Z k (i.e. n aU p = n a + np whenever a, j5 e are disjoint). 
Then the maps of the form 

a e JP (N) --» g pM , where g e G„p e Z[X],p{0) = 0,degp < i 

generate an FVIP subgroup F < P {^, G.). 

Proof. It is clear that the maps of the above form are indeed polynomial and their 
symmetric derivatives have the same form. The generated subgroup F < P o (^0,G.) 
is clearly closed under conjugation by constants and is closed under D by H4.20D . It 
remains to show that F is finitely generated. 

For this end assume that G is torsion-free and let T x ,...,T t be an ordered basis of 
G. By Lemma [4.21| F is generated by maps of the form 

a e ^j(N) —> Tj , where Tj G G,, q monomial with degq < i. 

Passing to a covering we may remove the assumption that G is torsion-free. □ 

5. Measure recurrence 

In this section we prove a "nilpotent" generalization (Theorem l5.26l) of the polyno- 
mial IP multiple recurrence theorem due to Bergelson and McCutcheon [ BM00 , Theo- 
rem 1.3] . We generally follow their proof and import many intermediate results whose 
proofs do not rely on commutativity. We write out only the parts of the argument that 
are different in the non-commutative setting. 

We now fix a nilpotent group G with a filtration G. and an FVIP group F < P (^"b> G.). 

Whenever we talk about measure spaces [X, j4 , (Y,SS,v) or [Z, ', y) we sup- 
pose that they are regular and that G acts on them on the right by measure-preserving 
transformations. This induces a left action on the corresponding L 2 spaces. Recall 
that to every factor map (Z, c £, y) —> (Y, 9&, v) there is associated an essentially unique 
measure disintegration 




Tydviy), 



see IIFur811 §5.4]. We write || • \\ y for the norm on L 2 {Z,j y ~). Recall also that the fiber 
product Z x y Z is the space Z x Z with the measure J v J y ® T y dv(y). 
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Extensions. Following the general scheme of Furstenberg's proof, the multiple recur- 
rence theorem is obtained by (in general transfinite) induction on a suitable chain of 
factors. In the induction step we go from a factor to a "primitive extension" that en- 
joys a dichotomy: each element of the given FVIP group acts on it either relatively 
compactly or relatively mixingly. We review the steps in the construction of such an 
extension, referring to [BMOO] for most proofs. 

Compact extensions. We begin with the appropriate notion of relative compactness. 
Heuristically, an extension is relatively compact if it is generated by the image of a 
relatively Hilbert-Schmidt operator. 

Definition 5.1 f llBMOOl Definition 3.4]). Let Z -» Y be a factor. A Z\Y-kernel is a 
function H G L°°(Z x y Z) such that 



H( Zl ,z 2 )d rz2 ( Zl ) = 
for a.e. z 2 g Z. If H is a Z|7-kernel and cp G L 2 (Z) then 



H(z 1 ,z 2 )0(z 2 )dr (z 2 ). 



The map cp —* H* cp is a Hilbert-Schmidt operator on almost every fiber over Y with 
uniformly bounded Hilbert-Schmidt norm. These operators are self-adjoint provided 
that H(z 1; z 2 ) = H(z 2 ,z 2 ) a.e. 

Definition 5.2 f llBMOOl Definition 3.6]). Suppose that X -» Z -» Y is a chain of 
factors, K < F is a subgroup and H is a non-trivial self-adjoint X\Y -kernel that is K- 
invariant in the sense that 

IP-lim g(a)H =H 

a 

for every g &K. The extension Z — > Y is called K -compact if it is generated by functions 
of the form H*cp,cp<E L 2 {X). 

Lemma 5.3 ([BMOO, Remark 3.7(i)]). Let X -* Z -* Y be a chain of factors in which 
Z —* Y is a K-compact extension generated by a X\Y -kernel H. Then H is in fact a 
Z\Y -kernel and Z is generated by functions of the form H *<p, (p e L 2 (Z). 

Proof. Call the projection maps n : X — > Z, 6 : X — > Y. Let e L 2 {X). Since H * 4> is 
Z-measurable we have 

H*<t>(x)= H*0(x 1 )dju JtW (x 1 ) 

H(x l ,x 2 )<t>(w 2 )dy l e(x l )(x 2 )diJ, 7tM (x 1 ) 
H(x 1 ,x 2 )(/)(x 2 )d j u eM (x 2 )dM 7l(x) (x 1 ) 

H(x 1 ,x 2 )d/i 7t(;c) (x 1 )^(x 2 )d/i 0(;c) (x 2 ) 

= E(H\Z x F X)*0(w). 

Since this holds for all (p we obtain H = E(H|Z x y X). Since H is self-adjoint this 
implies that H is Z x y Z-measurable. This in turn implies that H * cp = H* ¥,{cp |Z) for 
all cp e L 2 (X). □ 

In view of this lemma the reference to the ambient space X is not necessary in the 
definition of a K-compact extension. Just like in the commutative case, compactness 
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is preserved upon taking fiber products (this is only used in the part of the proof of 
Theorem 15 . 1 8 1 that we do not write out). 

Lemma 5.4 ([BMOO, Remark 3.7(h)]). Let Z -» 7 be a K-compact extension. Then 
Z x y Z — » 7 is also a K-compact extension. 

Primitive extensions. Now we define what we mean by relative mixing and the di- 
chotomy between relative compactness and relative mixing. 

Definition 5.5 ( IIBM001 Definition 3.5]). Let Z -» Y be an extension. A map g e F is 
called mixing on Z relatively to Y if for every H e L 2 (Z x y Z) with E(H|7) = one has 
w-IP-lim a g(a)H = 0. An extension Z — » 7 is called K-primitive if it is K-compact and 
each g e F \ K is mixing on Z relative to 7. 

The next lemma is also used in the suppressed part of the proof of Theorem l5.18l 

Lemma 5.6 ([BMOO, Proposition 3.8]). Let Z — > 7 be a K-primitive extension. Then 
Z x y Z — » 7 is also a K-primitive extension. 

Like in the commutative setting [McC05, Lemma 2.8] the compact part of a primi- 
tive extension is wlog closed under taking derivatives, but there is also a new aspect, 
namely that it is also closed under conjugation by constants. 

Lemma 5.7. Let Z — > 7 be a K-primitive extension. Then K is closed under conjugation 
by constant functions. Moreover wlog K is an FVIP group. 

Proof. Let g e F \K, h e G and H e L 2 (Z x F Z) be such that E(H|7) = 0. Then 
w-IP-lim(h _1 gh)(a)H = h" 1 w-IP-lim g(a)(hH) = 0, 

a a 

so that F \ K is closed under conjugation by constant functions, so that K is also closed 
under conjugation by constant functions. 

Since F is Noetherian, the subgroup K is finitely generated as a semigroup. For a 
generator g eif by Hindman's Theorem 14.71 we may wlog assume that either D a g e K 
for all a e ^ or D a g ^ K for all a e j£\ In the latter case we obtain 

= w-IP-lim D«g(a)H = IP-limg(a) _1 g(a u ^)g(^) _1 H = H, 

a contradiction. Thus we may assume that all derivatives of the generators lie in K. 
This extends to the whole group K by ( I4.20D and invariance of K under conjugation 
by constants. □ 

Existence of primitive extensions. Since our proof proceeds by induction over primitive 
extensions we need to know that such extensions exist. First we need a tool to locate 
non- trivial kernels. 

Lemma 5.8 ([BMOO, Lemma 3.12]). LetX -> 7 be an extension. Suppose that O^fle 
L 2 {X x y X) satisfies E(ff|7) = and that there exists g eF such that IP-lim a g(a)H = H. 

Then there exists a non-trivial self-adjoint non-negative definite X\Y -kernel H' such 
that IP-lim a g{a)H' = H'. 

Secondly we have to make sure that we cannot accidentally trivialize them. 

Lemma 5.9 ([BMOO, Lemma 3.14]). Let Z — > 7 be a K-compact extension. Suppose 
that for some g £jf and self-adjoint non-negative definite Z\Y -kernel H we have 



IP-lim 

a 

for all f e L°°(Z). Then H = 0. 



(g(a)H)(/' ®/')df = 
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The next theorem that provides existence of primitive extensions can now be proved 
in the same way as in the commutative case [BMOO, Theorem 3.15]. The only change is 
that Theorem |T2] is used instead of [BMOO, Theorem 2.17] (note that F is Noetherian, 
being a finitely generated nilpotent group). 

Theorem 5.10. LetX —>Y be a proper factor. Then there exists a subgroup K < F and a 
factor X —* Z — » Y such that the extension Z —>Y is proper and wlog K-primitive. 

Almost periodic functions. For the rest of Section [5] we fix a good group FE < 
P o (j^0, G.)®". We will describe what we mean by "good" in Definition 15.231 for the 
moment it suffices to say that FE is countable. 

Definition 5.11 f llBMOOl Definition 3.1]). Suppose that (Z, <g,y) -» (Y,S8,v) is a 
factor and K < F a subgroup. A function / e L 2 {Z) is called K -almost periodic if for 
every e > there exist g\,...,g\ e L 2 (Z) and D e 33 with v(D) < e such that for every 
5 > and R e K®" n FE there exists a Q such that for every a Q < d e " there exists a 
set E = E(a) e 58 with v(£) < 5 such that for all y e Y \ (D U E) there exists 1 < j < I 
such that 

||R(a)/-g J || y <e. 
The set of K-almost periodic functions is denoted by AP(Z, Y, K). 

The next lemma says that a characteristic function that can be approximated by 
almost periodic functions can be replaced by an almost periodic function right away. 

Lemma 5.12 f llBMOOl Theorem 3.3]). Let Ae? be such that 1 A e AP(Z,Y,K) and 
5 > 0. Then there exists a set A' czA such that y{A \ A') < 5 and 1 A > e AP(Z, Y, K). 

In the following lemma we have to restrict ourselves to K®" n FE since K®" need 
not be countable. 

Lemma 5.13 ([BMOO, Proposition 3.9]). LetX —>Y bean extension, K < F a subgroup 
and H a X\Y -kernel that is K-invariant. Then wlog for all R e K®" n FE and e > there 
exists a such that for all a < awe have 

\\R(a)H-H\\ < e. 

With help of the above lemma we can show that in fact wlog every characteristic 
function can be approximated by almost periodic functions. In view of Lemma 15.121 
this allows us to reduce the question of multiple recurrence in a primitive extension to 
multiple recurrence for (relatively) almost periodic functions. 

Lemma 5.14 ([BMOO, Theorem 3.11]). Let Z -» Y be a K-compact extension. Then 
wlog AP(Z,Y,K) is dense in L 2 {Z\ 

Multiple mixing. Under sufficiently strong relative mixing assumptions the limit be- 
havior of a multicorrelation sequence \\ { S i {a)f i on a primitive extension only depends 
on the expectations of the functions on the base space. The appropriate conditions on 
the set {Sj; are as follows. 

Definition 5.15. Let K < F be a subgroup. We call a subset A c F K-mixing if 1 G e A 
and g _1 fte F \ K whenever g^h&A. 

The requirement 1 G e A is not essential but is convenient in inductive arguments. 
In order to apply PET induction we will need the next lemma. 

We say that a subgroup K < F is invariant under equality of tails if whenever SeJ( 
and T e F are such that there exists /3 e & with S a = T a for all a > fi we have 
T e K. Every group K < F that is the compact part of some primitive extension has 
this property. 



NILPOTENT IP POLYNOMIAL MULTIPLE RECURRENCE 



21 



Lemma 5.16. Let K < F be a subgroup that is invariant under equality of tails. Let 
S,T eF be such that S~ J T Thenwlog 

{Sb p SY l (TL)pT)<£K and S~\TD p T) 

for all p e % 

Proof. If the first conclusion fails then by Hindman's theorem [4?7] wlog 

h{a) := (S6 a S) _1 (rD a r) e K for all a e & 

and h(0) ^ K. Proceed as in the proof of Lemma |4.10| Analogously for the second 
conclusion. □ 

The next lemma is a manifestation of the principle that compact orbits can be 
thought of as being constant. 

Lemma 5.17 ([BMOO, Proposition 4.2]). LetZ^Ybea K-primitive extension, R 13 <eK 
for each p&^andW&F\K.Let also f,f £ L°°(Z) be such that either E(/|Y) = or 
E(/'|Y) = 0. Thenwlog 

IP-lim||E(R^(a)/W(jS)/'|y)|| =0. 
We come to the central result on multiple mixing. 



Theorem 5.18 (cf. [BMOO, Theorem 4.10]). Let K < F be a subgroup. For every K- 
mixing set {S = 1g,S 1; ... ,S t ] cF the following statements hold. 

(1) For every K-primitive extension Z —>Y and any fo,...,f t e L°°(Z) we have wlog 

t t 
w-IP-lim^S^a)/, - ]~[s ! (a)E(/ I |y) = 0. 

i = l i = l 

(2) For every K-primitive extension Z — > Y and any fo,...,f t e L°°(Z) we have wlog 

t t 

IP-lim E(P]Si(o)/ t |y) -f]s t Co)ECfi|y) =0. 

a i=0 i=0 

(3) For every K-primitive extension Z — » Y , any U i j e K and any /y e L°°(Z) we 
have wZog 

t s t s 



IP-lim kf]Si(«)(ll ^uC^/u) 1 7 ) - ]~[ S > (a)E ( 11 u uCa)/u | Y) 



j=Q i=0 j=0 



: 0. 



We point out that the main induction loop is on the mixing set. It is essential that, 
given K < F, all statements are proved simultaneously for all K-compact extension 
since the step from weak convergence to strong convergence involves a fiber product 
via Lemma [531 



Proof. The proof is by PET-induction on the mixing set. We only prove that the last 
statement for mixing sets with lower weight vector implies the first, the proofs of other 
implications are the same as in the commutative case. 

By the telescope identity it suffices to consider the case E(/ ; | Y) = for some i . By 
the van der Corput Lemma |4.15| it suffices to show that wlog 

r t t 



IP-lim 

I3,a 



This limit can be written as 

r t 



IP-lim 



n s ^ a xf.n s < (au «i; =o - 

i=l i = l 

t 

nSiWfiYlSiiatfpSiiaXSiWfil 



z 1=1 i=l 
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By Lemma 15.161 we may wlog assume that T. } Tj^ and S t 1 Tj p are mixing for all 
fi e ^ provided that i ^ j. Re-indexing if necessary and using Hindman's theorem RT71 
we may wlog assume Sr 1 ^ « e K for all [3 e ^ and i < w and Sr 1 ^ « ^ if for all 
/3 e and i > w for some w = 0, . . . , t. Thus 

(5.19) S ,Si, . . . ,S t , T w+ ip, . . . , T t p 

is a -mixing set for every /3 # 0. Moreover it has the same weight vector as {S^ . . . ,S t ] 
since T t « ~ S ; . Assume that S ; , j 7^ 0, has the maximal level in J5.19D . We have to 
show 



IP-lim 



f] Sr 1 (a)S i (a)(/ i 6 /3 S i (a)(S i (^)/ i )) 

1=1 

t 

• P[ Sr 1 (a)S i (a)/iSr 1 (a)r j!jS (a)(S i (iS)/ i ) = 5 



i=w+\ 

and for each fixed e the limit along a comes from the iC-mixing set 

Sj 1 S 1 ,...,Sj 1 S t ,S- 1 T w+ i t ji, ■ ■ ■ ,Sj T t p 

that has lower weight vector. Hence we can apply the induction hypothesis, thereby 
obtaining that the limit equals 



f w 

IP-lim 



z i=l 



]^[sr 1 (a)S i (a)ECfi^S i (a)(S i (^)/i)|y) 
t 

■ Y\ S- 1 (a)S I (aM/,l^)S- 1 (a)T, ji3 (a)E(S I (/3)/ I |y) 

i=w+l 

This clearly vanishes if i > w, otherwise use Lemma [5. 171 □ 

Multivariable polynomial mixing. In fact we need some information about relative 
polynomial mixing in several variables. First we need to say what we understand under 
a mixing system of polynomial expressions. Recall that by definition each S e F® m can 
be written in the form 

(5.20) S(Oi, . .., ctj = W^-- a ^\a m ) . . . W a ^a 2 )W{ ai ). 

Definition 5.21. Let K < F be a subgroup and m e N. We call a set {S;}[ =0 c F® m 
K-mixing if S = 1 G , the polynomial expressions {SJ are pairwise distinct and for all r 
and i / ;' we have either V5 e W, 3 = W 3 or Va e (W/*) -1 wf & K. 

Note that this definition requires more than {W™},- being (up to multiplicity) a K- 
mixing set. However this definition still agrees with the previous definition of a K- 
mixing subset of F if m = 1. 

Theorem 5.22 (cf. IIBM001 Theorem 4.12]). Let Z —> Y be a K-primitive extension. 
Then for every m > 1, every K-mixing set {S , . . . ,S t } c F® m and any f , . . . ,f t e L°°(Z) 
we have wlog 



IP-lim 

a,,...,a m 



E 



(Y\s i {a 1 ,...,a m )f l \Y)-Y\s i {a 1 ,...,a m Mf l \Y) 



= 0. 



Proof. We use induction on m. The case m = is trivial since the product then consists 
only of one term. Assume that the conclusion holds for m and consider a K-mixing set 
of polynomial expressions inm + 1 variables. For brevity we write a = (a 1; . . . , a m ) 
and a = a m+1 . We may assume that WfW^ < 1 for all i and E(/ ; \Y) = for some £ . 
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By Defintion l5.21l and using notation from (15.201 ) for every a there exists a K-mixing 
set {Vj} c F such that W ; a = V*, where the assignment i — » j" ; does not depend on a. 
Let also 

A, = {S(-) = S £ (-,0) :;,=./}. 

In view of the Milliken-Taylor theorem 14.81 and by a diagonal argument it suffices to 
show that for every 5 > there exist a < a such that 



|E(f[v/(a)(f[s(a)/ Sj )|7) 



SeA, 



< 5 



provided that E(/ s j \Y) = for some j ,S Q . By the induction hypothesis there exists 
a such that 



E 



iY\ S ^fs, k \Y) 



< 5 



since A, is a K-mixing set. This implies 



nv/(a)E(f[s(a)/ Sj |7) 



SeA, 



< 5 



for all a > a. Since {V"}j is a K-mixing set Theorem 15 . 1 8 1 implies 



IB-Urn ||E(f3 v/(o) ]~[ S{a)f SJ \Y) 



< 5. 



□ 



SZ property. We are nearing our main result, a multiple recurrence theorem for poly- 
nomial expressions. In order to guarantee the existence of the limits that we will 
encounter during its proof we have to restrict ourselves to a certain good subgroup of 
the group of polynomial expressions. It will be shown later that this restriction can be 
removed, cf. Corollary 15 .3 II 

Definition 5.23. We call a group FE < P (<% G.)®" good if it has the following prop- 
erties. 

(1) (cardinality) FE is countable, 

(2) (substitution) if m e N, g e P G% G.)® m n FE and e (J?„)< then g[/3] e FE 
and 

(3) (decomposition) if K < F is a subgroup invariant under conjugation by con- 
stants and {SJ[ =0 c F® m n FE is a finite set with S = 1 G then we have finite 
sets {T k } v k zl c F 8m n FE and {RiY i=0 c K® m n FE with R = T = 1 G such that 
S ; = T fc .R ; and for every sub-IP-ring c the set {T fc } is wlog JC-mixing. 

The property of being good is hereditary in the sense that a group that is good with 
respect to some IP-ring is also good with respect to any sub-IP-ring. 

Let (X,j4,\i) be a regular measure space with a right action of G by measure- 
preserving transformations. Let also FE < F®" be a good group. By Hindman's theo- 
rem |4.7l we may wlog assume that 

w-IP-limg(a)/ 

a 

exists for every geF and / e I 2 (X) and by the Milliken-Taylor theorem [4.8| we may 
wlog assume that the limit 



IP-limiifn^aASAaT 1 ) 



exists for every m e N, every j4 and every finite set {S , . . . , S t } c F® m n FE. 
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Definition 5.24. A factor {X, j4 — » {Y, SS,v) is said to have the SZ (Szemeredi) 
property if for every B e ^ with v(B) > and every set of polynomial expressions 
{S = l G ,S 1 ,...,S t Jc F® m n FE one has 

IP-linijU fnL nBSAS)" 1 ) > a(B,m,{Sih) > 0. 

Note that unlike in [BM00, Definition 5.1] the constant depends not only on the 
number of polynomial expressions. We cannot obtain more uniform results due to the 
lack of control on the number w provided by Corollary |3.81 

Lemma 5.25 ([BM00, Proposition 5.2]). For every separable regular measure-preserving 
system X there exists a maximal factor that has the SZ property. 

Theorem 5.26. The identity factor X — > X has the SZ property. 

Proof. By Lemma 15.251 there exists a maximal factor X — » Y with the SZ property. 
Assume thatX ^ Y, then by Theorem 15.101 wlog there exists a subgroup K < F and 
a factor X —* Z such that (Z, X) —* (Y, 9S, v) is a proper K-primitive extension. We 
will show that Z also has the SZ property, thereby contradicting maximality of Y. 

Let Ae 1 with X(A) > and {S;}- =0 c F® m n FE be a finite set with S = 1 G . We 
have to show 

(5.27) IP-limju fn'AS.Ca)" 1 ) > 0. 

By Lemma [5T71 we may wlog assume that K is an FVIP group and by Lemma [5.14| that 
AP is dense in L 2 (Z). Note that FE is still good with respect to the new IP-ring implied 
in the "wlog" notation. Thus wlog we have a K -mixing set {W k } v k zl and polynomial 
expressions R t e K® m n FE with R = 1 G such that S t = W fc .R,-. 

By Lemma [5.12l we may replace A by a subset that has at least one half of its measure 
such that l A e AP. There exist c = c(A(A)) > and a set B e SB such that v[B) > c and 
X y {A) > c for every yeB. Pick < e < min(c/2, c v /(4(t + 1))). 

By Corollary |3 .81 there exist JV,weN and 

{L h M^ =l c {K m n FE) x (F m n FE) 

such that for every Z-coloring of {L ; ,M ; } there exists a number a and sets j3 1 < ■■■ < 
/3 m czN such that the set {L a R 1 [/3],M a W t [^]L- 1 } 

<i<t,o<k<v i s monochrome (and in 

particular contained in the set {L^M,-}). 

Since / = 1 A e AP there exist functions g\,...,gi e i 2 (Z) and a set D e ^ such 
that v(D) < e and for every 5 > and T e i<^® N n FE there exists a such that for 
every a < a e there exists a set £ = £(a) e ^ with v(£) < 5 such that for every 
je(DU £) c there exists j such that ||T(a)/ - gj\\ y < e. Let B' = B n D c , so that 
v(B')>c/2. 

Let Q = | (iV)™ | be the number of possible choices of $ e Jf(JV)™ and 

fll := a(B',JV,{l G } U {M^.t^]}^,^^^,^) > 0. 

Using this with 5 = aj/2w 2 and T = L 1} . . . , L w we obtain wlog for every ct e 
a set E = £(a) e ^ with v(£) < a 1 /2w such that for every ye(DU £) c and every 
i = 1, . . . , w there exists j = }{y, i) such that 

(5.28) - gylly < e for every 1 < i < w. 

By Theorem l5.22l we may also wlog assume that for every a e we have 

(5.29) \\E{Y\w k {a)f\Y) - Y\w k {aMf\Y)\\ < c v {aj2wQ^ 2 /4. 

k<v k<v 
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Recall that we have to show J5.27D - For this end it suffices to find a(A,m, {Sj}o<;<t) 
such that for an arbitrary sub-IP-ring there exists f e &™ with 

H (n^oAS.Cf)" 1 ) > a{A, m, {S;} <;< t ) > 0, 

so fix a sub-IP- ring & . By definition of a x there exists a tuple a e (that will remain 
fixed) such that 

v (C ) > a lt where C := n i=lj . iWk<v j;B'W k [fc{a)- 1 M i {.&T 1 . 

Let 

C ■= C \ U^ECeOMjCeO -1 , 

so that v(C) > aj/2. For every y e C consider an Z-coloring of {L i ,M i } i given by 
i e [l,w] >-» j'(yM;(a),i) determined by (I5.28D . By the assumptions on {L^Mj there 
exist a e [1) w l an d /Si < • • • < P m c iV such that 

||L a (5)R,[#](a)/ - g J Cy)Hy^CeOwKfti.ca)- 1 < e for ever y 0<i<t,0<fc<v. 
This can also be written as 

l|W fc [^](a)R i [#](5)/-W fc [^](a)L a (ar 1 ^ (y) || yMa(3) < e for every < i < t,0 < k < v. 

Since this holds for every i, k and we have R = 1 G this implies 

ll(W fc H i )[#](3)/-W fc [^](5)/|| yMa(a) <2e. 

Passing to a subset C'cC with measure at least a 1 /2wQ we may assume that a and /3 
do not depend on y. Thus we obtain a set B" := C'M a {a) of measure at least a 1 /2wQ 
and a tuple {jj = u iej3- a i)^i suc h tnat 

IIWfcRiff)/ -W fc (f)/ || y <2e 

for every y e B", i and fc. Recall that / is {0, l}-valued, so that 

t t t 

|| Y\ s i(r)f -T\ w «Wf\\ y = II ll w kMr)f -Yl w ^f\\y < 2(t + 1)e 

i=0 fc<v i=0 i=0 

for all y e B". Moreover, since B" c n^'W^f) -1 , one has 

in^OT/l y Xy)|>c v 

for every y e B". Therefore and by J5.29D we obtain 

lllW^IIMlW^IU") 

1=0 1=0 



> || Y\ W ^f)f || w - 2(t + l)ev(B") 1 

k<v 

> ||E(f]w fc (f)/|Y)|| i2(B , - 2(t + l)ev(B") 1 

| Y\ w k (fMf\Y)\\ LKBn - ||E(f[ witCfO/ 1 y) - n ^(f)W|y: 



11/2 

"kVUJ I* V|| L J CB //) -V- i 

> 

fc<v fe<v fe<v 

-2(t + l)ev(B") 1/2 
> c v (a 1 /2wQ) 1 / 2 /4=: a(A m, {SJ ; ) 1/2 - □ 
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Good groups of polynomial expressions. As we have already mentioned, good groups 
are just technical vehicles. The point is that we can perform all operations we are inter- 
ested in within a countable set of polynomial expressions, so that we can wlog assume 
the existence of all IP-limits that we encounter. 

The only non-trivial property of good groups is the decomposition property. How- 
ever, the following lemma essentially shows that it is always satisfied. 

Proposition 5.30. Let K < F be a subgroup that is invariant under conjugation by 
constants, m e N and {S;}[ =0 C F 8 ™ be any finite set with S = 1 G . Then there exists 
a set {T k }l_ Q c P" 8 ™ that is wlog K-mixing and decompositions S, = T k R t such that 
R t e K® m . 

Proof. We argue by induction on m. The claim is trivial for m = 0. Assume that it holds 
for m, we show its validity for m+1. For brevity we write d = (a 1; . . . , a m ), a = a m+1 . 

Consider the maps S ; (a) = S ; (a, 0). By the induction hypothesis there exists a 
set {T k }l =Q c F® m that is wlog K-mixing and decompositions S ; = f k Ri such that 
A, e K® m . Then S ; (5,a) = W? ^(a)f fc( (5)^(5). 

Let i < j. By the Milliken-Taylor Theorem 14.81 we may wlog assume that either 
{Wfy l wf K for all a e &™ (in which case we do nothing) or {W^wf e K for 
all a e In the latter case we have W. = W*R a with some R a e K and we can 
write 

S i (a,a) = W i s (a)f fcj (a)(f fc .(a)- 1 il%(a))(a)R i (a) ) 3e^. 



Doing this for all pairs i < j we obtain the requested decomposition with the set { T k } 
consisting of all products Wff k .(d) that occur above. □ 

Corollary 5.31. Every finite subset of F® w is wlog contained in a good subgroup of 

Proof. Since F is a countable Noetherian group it has at most countably many sub- 
groups. Moreover each (^)™ is countable and there are only countably many finite 
tuples in any countable set. Hence we can use Proposition 15.301 to obtain a count- 
able descending chain of sub-IP-rings such that the decomposition property holds for 
each tuple for one of these sub-IP-rings. The required sub-IP-ring is then obtained by a 
diagonal procedure, cf. [FK85 ( Lemma 1.4]. □ 

Thus the good group is not really relevant for our multiple recurrence theorem that 
we can now formulate as follows. 

Theorem 5.32. Let Gbea nilpotent group and F < P o (^0> G.) an FVIP group. Consider a 
right measure-preserving action of G on an arbitrary (not necessarily regular) probability 
space (X, j4 , [i). Let S ,...,S t 6 F® m be arbitrary polynomial expressions and Ae j4 
with n(A) > 0. Then there exists a sub-IP-ring SF' c & such that 

IP-lim a TnLnAS/a)" 1 ) > 0. 

Proof. We can assume S Q = 1 G . By Corollary |5.31l we may assume that S ,...,S t e FE 
for some good subgroup FE < P o (^"0> G.) 8 ™. Then we can replace G by a countable 
group that is generated by the union of ranges of elements of FE. Afterward we can 
replace j4 by a separable G-invariant cr-algebra generated by A. Finally we can assume 
that X is regular and apply Theorem l5.26l □ 

Theorem[TTT]follows from Theorem [5T32l and Lemma [4^221 with the filtration J2T271 ), 
d being the maximal degree of polynomials p ; ■. By the Furstenberg correspondence 
principle we obtain the following combinatorial corollary 
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Corollary 5.33. Let G be a finitely generated nilpotent group, T 1 ,...,T t e G and p, ; e 
1\X\, ■ ■ ■ ,X m ]j i = 1, . . ., t, j = 1, . . . ,s, be polynomials that vanish at 0. Then for every 
subset E c G with positive upper Banach density the set 

t 

{n<=Z m :3geG:gY\T* itm &E ,j = l,...,s} 

i = l 

is IP* in Z m . 
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